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Abstract. We show that every function of several variables on a finite set of 
k elements with n > k essential variables has a variable identification minor 
with at least n — k essential variables. This is a generalization of a theorem of 
Salomaa on the essential variables of Boolean functions. We also strengthen 
Salomaa's theorem by characterizing all the Boolean functions / having a 
variable identification minor that has just one essential variable less than /. 



1. Introduction 

Theory of essential variables of functions has been developed by several authors 
[21 [H HU [13]. In this paper, we discuss the problem how the number of essential 
variables is affected by identification of variables (diagonalization). Salomaa pTj 
proved the following two theorems: one deals with operations on arbitrary finite 
sets, while the other deals specifically with Boolean functions. We denote the 
number of essential variables of / by ess/. 

Theorem 1. Let A be a finite set with k elements. For every n < k, there exists 
an n-ary operation f on A such that ess / = n and every identification of variables 
produces a constant function. 

Thus, in general, essential variables can be preserved when variables are identified 
only in the case that n > k. 

Theorem 2. For every Boolean function f with ess/ > 2, there is a function g 
obtained from f by identification of variables such that essg > ess / — 2. 

Identification of variables together with permutation of variables and cylindrifi- 
cation induces a quasi-order on operations whose relevance has been made apparent 
by several authors [3] [7l [8J [9], [10l [121 03] • I n the case of Boolean functions, this 
quasi-order was studied in [4j where Theorem[2]was fundamental in deriving certain 
bounds on the essential arity of functions. 

In this paper, we will generalize Theorem [2] to operations on arbitrary finite 
sets in Theorem [3] We will also strengthen Theorem [5] on Boolean functions in 
Theorem[5]by determining the Boolean functions / for which there exists a function 
g obtained from / by identification of variables such that essg = ess / — 1. 

2. Variable identification minors 

Let A and B be arbitrary nonempty sets. A B -valued function of several variables 
on A is a mapping / : A n — > B for some positive integer n, called the arity of /. 



Date: February 2, 2008. 



1 



2 



MIGUEL COUCEIRO AND ERKKO LEHTONEN 



A-valued functions on A are called operations on A. Operations on {0, 1} are called 
Boolean functions. 

We say that the i-th variable is essential in /, or / depends on Xi, if there are 
elements oi, . . . , a n , b £ A such that 

/(oi, ...,di,..., a n ) ^ f(ai, . . . , a,_i, b, a i+ i, . . . , a„). 

The number of essential variables in / is called the essential arity of /, and it is 
denoted by ess/. Thus the only functions with essential arity zero are the constant 
functions. 

For an n-ary function /, we say that an m-ary function g is obtained from / by 
simple variable substitution if there is a mapping a : {1, . . . ,n} — > {1,..., to} such 
that 

g(xi, . . .,x m ) = /(£<t(i), ■ ■ - ,aV(n))- 
In the particular case that n — m and a is a permutation of {1, . . . , n}, we say 
that g is obtained from / by permutation of variables. For indices i, j € {1, . . . , n}, 
i ^ j, if Xi and essential in /, then the function /i<_j obtained from / by 

the simple variable substitution 

fi>* — j 1 ' ' ' ' > *En) — f\^l : • ■ • j Xi — 1 , Xj , , . . . , X n ) 

is called a variable identification minor of /, obtained by identifying Xi with ajj. 
Note that ess/j^j < ess/, because Xi is not essential in /i«-j even though it is 
essential in /. 

We define a quasiorder on the set of all £?-valued functions of several variables 
on A as follows: / < g if and only if / is obtained from g by simple variable 
substitution. If / < g and g < f, we denote / = g. If / < g but g j£ /, we denote 
f < g. It can be easily observed that if / < g then ess / < ess g, with equality if 
and only if / = g. 

For a B-valued function / of several variables on A, we denote the maximum 
essential arity of a variable identification minor of / by 

ess < / = max ess g, 

9<f 

and we define the arity gap of / by gap / = ess / — ess < /. 

3. Generalization of Theorem [2] 

Theorem 3. Let A be a finite set of k > 2 elements, and let B be a set with at 
least two elements. Every B-valued function of several variables on A with n > k 
essential variables has a variable identification minor with at least n — k essential 
variables. 

In the proof of Theorem [3J we will make use of the following theorem due to 
Salomaa [HI Theorem 1], which is a strengthening of Yablonski's [13] "fundamental 
lemma" . 

Theorem 4. Let the function f : Mi x • • • x M n — > N depend essentially on all of 
its n variables, n > 2. Then there is an index j and an element c G Mj such that 
the function 

f{x\ , . . . , Xj — i , C, Xj-\-i , . . . , x n ) 
depends essentially on all of its n — 1 variables. 

We also need the following auxiliary lemma. 
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Lemma 5. Let f be an n-ary function with ess / = n > k. Then there are indices 
l<i<j<k + l such that at least one of the variables x\, . . . , Xk+i is essential 
in /{<_, . 

Proof. Since x\ is essential in /, there are elements a\, . . . , a n , b G A such that 
f(a\, 02, • • • , On) 7^ /(&, «2, • ■ ■ , a„). 

Thus there are indices l<i<j<fc+l such that ai = aj. If i 7^ 1, then it is clear 
that xi is essential in /i<_j. If there are no such i and j with i 7^ 1, then i = 1 < j 
and we have that b = ai for some 1 < Z < fc + 1, / 7^ j. For m = 1, . . . , n, let c m = a m 
if m ^ and let c m = a\ if m £ {l,j, Z}. Then /(ci,C2, . . . , c„) is distinct 
from at least one of /(ai, 02, . . . , a n ) and f(b, ai, ■ ■ ■ , a n ). If /(ci, C2, . . . , c„) 7^ 
/(ai, a 2 , . . . , On), then a;; is essential in /!<_,-. If /(ci, c 2 , . . . , c„) ^ /(6, a 2 , . . . , a„), 
then is essential in fit—i- □ 

Proof of Theorem\M By Theorem SJ there exist fc + 1 constants ci, . . . , c^+i G ^4 
such that, after a suitable permutation of variables, the function 

/(ci, . . . ,Ck+l,Xk+2, ■ ■ ■ ,%n) 

depends on all of its n — k — 1 variables. There are indices l<i<j<fc + l such 
that Ci = Cj, and by Lemma[5] there are indices l</<m<fc + l such that at least 
one of the variables x\, . . . , Xk+i is essential in /;<— m . With a suitable permutation 
of variables, we may assume that i = 1, j = 2, 1 < I < 3, m = I + 1. 

If one of the variables x%, . . . , Xk+i is essential in 2, then we are done. Oth- 
erwise we have that for all GSfc+2, • ■ ■ , a n E A, 

f{ci, Ci, C3, C4, . . . , Cfc+l, Ofc+2) * * * ) On) = /(C3, C3, C3, C4, . . . , Cfc+i, afc +2 , • ■ ■ j 0, n ). 

Thus the variables Xk+2 , ■ ■ ■ , x n are essential in /2<_ 3. If one of the variables 
xi, . . . ,%k+i is essential in / 2 <— 3, then we are done. Otherwise we have that for 
all a k+2 , ...,a n G A, 

/(C3) C3, C3, C4, . . . , Cfc+i, Ofc+2) ■ ■ ■ j O-n) — /(C3, C4, C4, C4, . . . , Cfe+i, afc +2 , . . . , a„), 

and so the variables Xk+2, ■ ■ ■ , x n are essential in /3«_4 and also at least one of 
xi, . . . , Xk+i is essential in /3<_4. □ 

We would like to remark that our proof is considerably simpler than Salomaa's 
original proof of Theorem [2] 

4. Strengthening of Theorem [2] 

It is well-known that every Boolean function is represented by a unique multi- 
linear polynomial over the two-element held. Such a representation is called the 
Zhegalkin polynomial of /. It is clear that a variable is essential in / if and only 
if it occurs in the Zhegalkin polynomial of /. We denote by degp the degree of 
polynomial p. If p is the Zhegalkin polynomial of /, then we denote the Zhegalkin 
polynomial of by pj<-_y. Note that the only polynomials of degree are the 
constant polynomials. 

Theorem 6. Let f be a Boolean function with at least 2 essential variables. Then 
the arity gap of f is 2 if and only if the Zhegalkin polynomial of f is of one of the 
following special forms: 

• Xi^ ~\~ X%2 i ' ' ' ~!~ "T" C, 
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• X'lXj ~\~ Xi T" C, 

• XiXj H~ XiXfc -\- XjXk ~\~ C, 

9 X t Xj I XiXfe I X j X j% I •2' '( I I Cj 

where c £ {0, 1}. Otherwise the arity gap of f is 1. 

We prove first an auxiliary lemma that takes care of the functions of essential 
arity at least 4 whose Zhegalkin polynomial has degree 2. 

Lemma 7. // / is a Boolean function with at least four essential variables and the 
Zhegalkin polynomial of f has degree two, then the arity gap of f is one. 

Proof. Denote the Zhegalkin polynomial of / by p. We need to consider several 
cases and subcases. 

Case 1. Assume first that p is of the form 

p = XiXj + XiX k + xjXk + x t a t + Xjttj + x k a k + a, 

where Oj, dj, a k are polynomials of degree at most 1 and a is a polynomial of degree 
at most 2 such that there are no occurrences of variables Xi, Xj, x k in tXi, dj, a^, a. 

Subcase 1.1. Assume that dega^ = dega^ = Aega k = 0. Then a contains a 
variable xi distinct from Xi, Xj, x k , and we can write a = Xi<x' + a", where a' and 
a" do not contain xi. Then /;<_j is represented by the polynomial 

Pi^i = XiXj + XiX k + XjX k + XiO,' + a", 

where all essential variables of / except for xi occur, because no terms cancel, and 
hence gap / = 1. 

Subcase 1.2. Assume that at least one of a^, a.,-, a k has degree 1, say dega^ = 1. 
Then contains a variable xi distinct from Xi, Xj, x kl and so a, = xi + a' { , where 
a- has degree at most 1 and does not contain xi. Consider 

Pj^k = x k (l + Oj + a fe ) + Xidi + a. 

If all essential variables of / except for Xj occur in p .,-<_&, then gap / = 1 and we 
are done. Otherwise we need to analyze three different subcases. 

Subcase 1.2.1. Assume that variable x k occurs in pj<_fc but there is a variable xi 
that occurs in a, and but not in nor in a such that xi does not occur in pj^ k 
(due to some cancelling terms in aj and a^). Write dj = x t + a'j, a k = xi + a' k , and 
consider 

pj^i = XiXi + XiX k + xix k + x^ + xi + xidj + x k xi + x k a k + a 

= XiXi + x t x k + x t ai + xi + xia'j + x k a' k + a. 

Every essential variable of / except for Xj occurs in pj<—i , and hence gap / = 1. 

Subcase 1.2.2. Assume that x k does not occur in pj^ k . In this case dj = a k + 1- 
Consider 

= Xi(l + di + Oj) + x k a k + a. 

If any term of dj is cancelled by a term of di, it still remains as a term of a k , and 
hence all variables occurring in tXi, a.j, a k occur in pj^~i- If both Xi and x k also 
occur in pj^i, then all essential variables of / except for Xj occur in pj^i, and so 
gap / = 1. 

If x k does not occur in pj^i, then = and so dj = 1. Then 

p/< — i X^Xj ~\~ X-iXfc ~\~ XjXfc ~\~ X<i ~\- X-iQ^ ~\~ Xj ~\~ CI, 

and every essential variable of / except for xi occurs in p;^. Thus gap / = 1. 
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If Xi does not occur in pj<-i, then aj = Ck + 1, and hence ai = a&. Consider then 

pi^.fc = x k (l + Oi + a k ) + Xjdj + a = x k + Xjdj + a. 

Again all essential variables of / except for x\ occur in pi^ k , and so gap / = 1. 

Subcase 1.2.3. Assume that both Xi and x k occur in pj<— & but there is a variable 
xi occurring in Oj and in aj but not in nor in a such that xi does not occur in 
pj^k (due to some cancelling terms in 0j and aj). Write Oj = xi + a-, aj = xi + a'j, 
and consider 

pj^i = XiXi + x l x k + xix k + XiXi + Xia'i + xi + xia'j + x k a k + a 
= XiX k + x t x k + Xia'i + xi + xia'j + x k a k + a. 

Every essential variable of / except for Xj occurs in pjv-i, and so gap / = 1. 
Case 2. Assume then that p is of the form 

p = xiXj + x. k x k a lk + XiOi + Xjaj + x k a k + a, 

where ai k is a polynomial of degree 0; Oi, aj, a k are polynomials of degree at most 
1; and o is a polynomial of degree at most 2 such that variables Xi, Xj, x k do not 
occur in Oiu, Oi, aj, a k , a. Note that and a k cannot both be 0, for otherwise x k 
would not occur in p. Consider 

pj<-i = x l (l + ai + aj) + XiX k a lk + x k a k + a. 

By the above observation that ai k and a k are not both 0, x k occurs in pj^i- If 
all essential variables of / except for Xj occur in pj<— i, then gap / = 1 and we are 
done. Otherwise we distinguish between two cases. 

Subcase 2.1. Assume that xi does not occur in pj^i- In this case aj = Oj + 1, 
aik = 0, and a k ^ 0. Consider 

pj<_/s = XjX k + x k a ik + x k ai + x 3 aj + x k a k + a 

= XjXk + Xk(ai + a k ) + Xj + x.jai + a. 

Both Xj and x k occur in pi<_ because the term XjX k cannot be cancelled. If any 
term of is cancelled by a term of o^, it still remains in Xjat. Thus, all essential 
variables of / except for Xi occur in pj«_fc, and hence gap / = 1. 

Subcase 2.2. Assume that Xi occurs in pj^i but there is a variable xi occurring 
in Oj and Oj but not in Ojfe, Ofe, nor in a such that xi does not occur in pj^i (due 
to some cancelling terms in and aj). Consider 

pfc^z = XiXj + XiXia lk + x^i + Xjaj + xia k + a. 

If ai k = 1, then the terms XiXi in ijOi and in XiXiai k cancel each other. These are 
the only terms that may be cancelled out. Nevertheless, xi occurs also in aj, and 
so all essential variables of / except for x k occur in p/^;. Therefore gap / = 1 also 
in this case. □ 

Proof of Theorem^ Denote the Zhegalkin polynomial of / by p. It is straight- 
forward to verify that if p has one of the special forms listed in the statement of 
the theorem, then / does not have a variable identification minor of essential arity 
ess / — 1 but it has one of essential arity ess / — 2. For the converse implication, 
we will prove by induction on ess / that if p is not of any of the special forms, then 
there is a variable identification minor g of / such that ess g = ess / — 1, i.e., / has 
arity gap 1. 
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If ess/ = 2 and p is not of any of the special forms, then p = XiXj + c or 
p = XiXj + Xi + Xj + c where c € {0, 1}, and in both cases pj<— i = x% + c. In this 
case gap / = 1. 

If ess / = 3, then p has one of the following forms 

X^XjXk I X %X j I XiXk I XjXfe I CL^Xi | CLjXj \ (XfcXfc I 

X^XjXk H~ %i%k H~ ^j^k T" Cti&i H - ^j^j ^k-^k T" Cj 

X iX j X k I XiXj I CliXi I CLjXj I CLk'&k I ^5 

X iX j I XiXk I ^ j ^ k I I 

37 iX j I tZ/ 2 37 ^ I 37 ^ 37 ^ I 37 j I 37 ^ I 37 ^ I C ^ 

X iX j I XiXk I QiXi | CLjXj I CLkXk I 

XjXfe + aj£j + Oj-aij + a/jXfe + c, 

where a,-, Oj, a&, c G {0, 1}. It is easy to verify that in each case pj<— » contains the 
term XiXk, and hence both a;, and are essential in /j<_i, and so gap / = 1. 

For the sake of induction, assume then that the claim holds for 2 < ess / < rt, 
ft > 4. Consider the case that ess / = n. Since the case where degp = 1 is ruled 
out by the assumption that p does not have any of the special forms and the case 
where degp = 2 is settled by Lemma [3 we can assume that degp > 3. Choose a 
variable a; m from a term of the highest possible degree in p, and write 

p = x m q + r, 

where the polynomials q and r do not contain Xfc. We clearly have that degq = 
degp — 1, and q and r represent functions with less than n essential variables. Of 
course, every essential variable of / except for x m occurs in q or r. We have three 
different cases to consider, depending on the comparability under inclusion of the 
sets of variables occurring in q and r. 

Case 1. Assume that there is a variable xi that occurs in q but does not occur 
in r, and there is a variable Xj that occurs in r but does not occur in q. Write 

q = Xjq' + q", t=XjX? + v", 

where q', q", r', r" do not contain xi, Xj. Then 

p = x m Xi(\' + x m q" + Xjt' + r", 

and we have that 

Pj< — i •Erri'Eity ^ml T" ~~\~ t , 

where no terms can cancel. Hence all essential variables of / except for 
essential in and so gap / = 1. 

Case 2. Assume that every variable occurring in r occurs in q. In this case q 
represents a function q of essential arity ess / — 1, containing all essential variables 
of / except for x m . We also have that degq = degp — 1 > 2. 

Subcase 2.1. If ess/ > 5, then essg > 4, and we can apply the inductive 
hypothesis, which tells us that there are variables Xi and Xj such that esst&v- j = 
essg — 1. Hence /i<— j is represented by the polynomial pi^j — x m qi^j + tj^j, and 
all essential variables of / except for xi occur in pi<_j, since no terms can cancel 
between x m qi^j and ti^j. Thus gap / = 1. 

Subcase 2.2. If ess/ = 4, then essg = 3, and we can apply the inductive 

hypothesis as above unless q - — - X 2 Xj \ X^Xfe I XjXk I C 01" Cj — XiXj I XiXk I XjXk I 
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Xi + Xj + c. If this is the case, consider first the case where q contains a variable 
xi G {xi, Xj, Xk} that does not occur in r. Consider then 

p m ^i = xiq + r. 

Then xic\ contains the term XiXjXk, which cannot be cancelled. Namely, all other 
terms of xiq have degree at most 2, and since there are at most two variables 
occurring in r, the terms of r also have degree at most 2. Thus, all variables of / 
except for x m occur in p m< —i, and so the arity gap of / is 1. 

Consider then the case that q and r contain the same variables, i.e., Xi, Xj, Xk- 
If degr < 2, then it is easily seen that p m <—i contains the term XiXjXk, and all 
essential variables of / except for essential in / TO <_j. Otherwise, we can 

apply the inductive hypothesis on the function r represented by r and we obtain 
variables x a and xp such that essr a< —p = essr — 1. It can be easily verified that no 
identification of variables brings q into the zero polynomial, so x m and two other 
variables will occur in p a ^p = x m q a ^p + t a *-/3- We have that gap / = 1 also in 
this case. 

Case 3. Assume that every variable occurring in q occurs in r but there is a 
variable x\ that occurs in r but does not occur in q. If degr = 1, then r = xi + r' 
where r' does not contain x\. Then p m ^i — %iQ + xi + r', where the only term that 
may cancel out is xi, and this happens if q has a constant term 1. Nevertheless, x\ 
occurs in r mt _z because deg q > 2. Of course, all other essential variables of / except 
for x m also occur in p m <_j, so gap / = 1. We may thus assume that degr > 2. 

Subcase 3.1. Assume first that ess / = 4 (in which case r contains three variables 
and q contains at most two variables) and r = XiXj + XiXk + XjXk + c or r = 
XiXj + XiXk + XjXk + Xi + Xj + c. Since we assume that degp > 3, we have 
that degq > 2 and hence q contains at least two variables. Thus exactly two 
variables occur in q and so also deg q = 2. Then q = x a xp + b\x a + b 2 xp + d where 
a, /3 G {i,j, k} and 61, 62, d G {0, 1}. Let 7 G {i,j, k} \ {a, [3}. Then p m< _ 7 contains 
the term XiXjXk, and hence all essential variables of / except for x m occur in p m< _ 7 , 
and so gap / = 1 . 

Subcase 3.2. Assume then that ess / > 4 or ess / = 4 but r does not have any 
of the special forms. In this case we can apply the inductive hypothesis on the 
function r represented by r. Let x% and Xj be such that essr,^ = essr — 1. If 
c\j^i =/= 0, then x m and all other essential variables of / except for Xj occur in pj<-i, 
and we are done — the arity gap of / is 1. We may thus assume that qj<_j = 0. 
Write q and r in the form 

q = XiXjCLi + xia 2 + Xjtt3 + 04, 
r = XiXjbi + Xib 2 + Xjb 3 + b 4 , 

where the polynomials cii, CI2, 03, 04, bi, &2, &3, 64 do not contain Xi, Xj. Define 
the polynomials qi, . . . , qj as follows (cf. the proof of Theorem 4 in Salomaa [Tl]): 
qi consists of the terms common to di, a 2 , and 03. 

qi, i = 2, 3, consists of those terms common to cii and which are not in qi. 
q4 consists of those terms common to a 2 and 03 which are not in qi. 
q4+i, i = 1, 2, 3, consists of the remaining terms in aj. 

Define the polynomials and ti, . . . , similarly in terms of the bi's. Note that 
for any i ^ j, q^ and q^- do not have any terms in common, and similarly Xi and Xj 



8 



MIGUEL COUCEIRO AND ERKKO LEHTONEN 



do not have any terms in common. Hence, 

q = XiXj(qi + q 2 + qa + qs) + a?»(qi + q2 + q4 + qe) + x j (q 1 + q 3 + q 4 + q 7 ) + tk, 
r = XiXj (ti + t 2 + r 3 + r 5 ) + jc,-(ti + r 2 + r 4 + r 6 ) + Xj{ti + r 3 + r 4 + r 7 ) + b 4 . 
Identification of xi with Xj yields 

q J -^ l = Xi(qi + q 5 + qe + ^) + 04, 
tj^i = Xi(ti + r 5 + r 6 + r 7 ) + b 4 . 

Since we are assuming that qj^i = 0, we have that q 4 = q$ = q& = q 7 = a 4 = 0. 
On the other hand, q ^ 0, so q 2 , q3, q 4 are not all zero. Thus 

q = XiXj(q 2 + q3) + %i(<\2 + q«i) + x 3 (q 3 + q 4 ). 

All essential variables of / except for contained in t,-<_;. 

Subcase 3.2.1. Assume that there is a variable Xt occurring in b 4 that does not 
occur in r 4 , t5, te, t 7 . Consider 

p m ^t — x t q + r = x/q + XjXjbi + x { b 2 + Xjb 3 + b 4 . 

Cancelling may only happen between a term of Xt q and a term of r. No term of b 4 
can be cancelled, because every term of Xtq contains Xi or Xj but the terms of b 4 
do not contain either. The variables that do not occur in b 4 occur in some terms 
of bi, b 2 , b 3 that do not contain xt- Thus, all essential variables of / except for x m 
occur in p m <_t, and so in this case / has arity gap 1. 

Subcase 3.2.2. Assume that all variables of r except for Xi, Xj occur already in 
ti + ts + 16 + 1 7 . Consider 

p m ^i = XiXj(q 2 + q 4 + ti + t 2 + r 3 + r 5 ) + 

(1) Xi(q 2 + q 4 + ti + r 2 + r 4 + r 6 ) + 

Xj(ti +r 3 +r 4 +r 7 ) + b 4 . 

Subcase 3.2.2.1. Assume first that xi does not occur in p m ^i in (JlJ. Then 
q 2 + q4 + ti + r 2 + t 3 + r 5 = 0, 
q 2 + q4 + ti + r 2 + r 4 + r 6 = 0, 
and since the tj's do not have terms in common, we have that 
ti + r 2 = q 2 + q 4 , r 3 = r 4 = r 5 = r 6 = 0. 
Then all variables of r except for Xi, Xj occur already in ri + r 7 . Consider 
p m ^j = XiXj(q 3 + q 4 + ti + r 2 + r 3 + r 5 ) + 
Xi(li + r 2 + r 4 + r 6 ) + 
^ xj (q 3 + q 4 + ti + r 3 + r 4 + r 7 ) + b 4 

= XiXj{q 2 + q 3 ) + 
Xi(tl + 1 2 ) + 

xj{q 2 + q 3 + r 2 + r 7 ) + b 4 . 

All variables of ti are there on the fifth line of ([2]). If a term of r 7 is cancelled 
by a term of q 2 + q 3 on the sixth line, it still remains on the fourth line, so all 
variables of r 7 are also there. We still need to verify that the variables Xi and Xj 



THE EFFECT OF VARIABLE IDENTIFICATION ON ESSENTIAL ARITY 



9 



are not cancelled out from |(5J). If q 2 + q.3 7^ then wc arc done. Assume then that 
q2 + q3 = 0, in which case q4 7^ 0. Since 

ri + r 2 + r 4 + r 6 = ri + r 2 = q 2 + q4 = q4 7^ 0, 
we have Xi in @. Since 

q3 + q4 + ti + r 3 + r 4 + r 7 = q 4 + n + r 7 

and ti+t7 contain all variables of r except for Xi, Xj, but q 4 does not, q 4 +ti+r7 7^ 0, 
so we also have Xj in |2]). Thus, the arity gap of / equals 1 in this case. 

Subcase 3.2.2.2. Assume then that Xi occurs in p TO< -j in (JTJ. Nothing cancels 
out on the third line of (JTJ, and therefore the variables of ti and X7 occur in p m< _j. 
Terms of r5 may be cancelled out by terms of q 2 + q 4 on the first line of (fl|) 
but such terms will remain on the second line. Thus the variables of x§ occur in 
p m «_i. A similar argument shows that the variables of Xq also occur in p m <-i- In 
order for / to have arity gap 1, we still need to verify that Xj occurs in p TO< _j. If 
q2 + q4 + r i + r 2 + ^3 + r 5 7^ 0, then we arc done. We may thus assume that 

(3) q 2 + q 4 + ti + r 2 + r 3 + r 5 = 0. 

By the assumption that X4 occurs in p m ^i, the second line of {1} does not vanish, 
i.e., 

7^ 92 + q4 + ti + r 2 + r 4 + r 6 = r 3 + t 4 + r 5 + r 6 . 
If the third line of JT]) does not vanish either, i.e., X\ + X3 + 1 4 + Xr 7^ 0, then we have 
both Xi and Xj and we are done. We may thus assume that ti + 13 + 14 + 17 = 0, 
i.e., ti = r3 = r 4 = r7 = 0. Then all variables of x except for x%, Xj occur already 
in t5 + t6- Equation (J3j) implies that r 2 + x$ = q 2 + q 4 . Consider 

p m ^j = XiXj(q 3 + q 4 + ti + r 2 + r 3 + r 5 ) + 
Xi(ti + r 2 + r 4 + r 6 ) + 

Xj(({3 + q4 + ti + 1 3 + 14 + 1 7 ) + E>4 

( 4 ) / 

= XiXj {t\2 + q3j + 

Xi(q 2 + q 4 + r 5 + r 6 ) + 
Xj(%3 + %i) + &4- 

Assume first that q 2 + q3 = 0, in which case q 4 7^ 0. If a term of r5 + Xe is cancelled 
by a term of q 4 on the second line of |^|). it will still remain on the third line. 
Therefore we have in p m <—j all variables of r except for Xi and Xj . Since x§ + Xq 
contains all variables of r except for Xi, Xj but q 2 + q 4 = q 4 does not, the second line 
of (j4|) does not vanish, and so we have Xi. We also have Xj because q3 + q 4 = q 4 7^ 
on the third line. In this case / has arity gap 1. 

Assume then that q 2 + q3 7^ 0. Then the first line of (j4|) does not vanish and 
both Xi and Xj occur in p m *-j ■ If any term of x§ + Xq is cancelled by a term of q 2 
on the second line of ([4]), it still remains on the first line, and if it is cancelled by a 
term of q 4 , it remains on the third line. Thus all variables of r occur in p OT< _j, and 
/ has arity gap 1 again. This completes the proof of Theorem [6] □ 

5. Concluding remarks 

We do not know whether the upper bound on arity gap given by Theorem [3] is 
sharp. For base sets A with k > 3 elements, we do not know whether there exists 
an operation / on A with ess f > k + 1 and gap / > 3. We know that for all k > 2, 
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there are operations on a /c-element set A with arity gap 2. Consider for instance 
the quasi-linear functions of Burle pQ. A function / is quasi-linear if it has the 
form 

/ = ff(/ii(n) © h 2 (x 2 ) © ■ • • © h n (x n )), 
where hi, ■ ■ ■ , h n : A — > {0, 1}, g : {0, 1} — > A are arbitrary mappings and © denotes 
addition modulo 2. It is easy to verify that if those hi's that are nonconstant 
coincide (and g is not a constant map), then / has arity gap 2. 

In general, if there is an operation / on a fc-element set A with with gap / = m, 
then there are operations of arity gap m on all sets B of at least k elements. Namely, 
it is easy to see that any operation g on B of the form 

.9 = <P{f{l{x\),l{x2), ■ ■ . ,7(x„))), 
where 7 : B — > A is surjective and <fi : A — > B is injective, satisfies essg = ess / and 
gap g = gap/. 
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